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The  Fortran  Deductive  System 

J.  R*  Quinlan  and  E‘.  B.  ITunt 
The  University  of  Washington 

The  Fortran  Deductive  System  (FSD)  i3  a  program  which 
solves  generalized  theorem  proving  problems.  It  receives 

■j 

a#  input  the  dofinS-tion  of  e  deductive  system  (algebra, 
geometry,  first  order  predicate  calculus,  etc0)s  then 
series  of  theorems.  The  program  attests  to  prove  each 
theorem,  using  as  premises  the  axioms  of  the  system  and  any 
previously  proven  theorems  which  the  pregram  use  ;  has  ordered 
it  to  ret* lrio  The  notation  used  by  the  program  to  define 
a  particular  deductive  system  is  very  similar  to.  that  used 
in  conventional  school  mathematics*  This  makes  it  possible 
for  a  person  to  ueo  the  system  ovan  though  he  has  r.o  Yiov - 
ledge  of  FDS*  internal  structure*  In  fact,  most  i^srs 
the  system  do  not  have  this  knowledge , 

The  FDS  was  developed  as  a  vehicle  for  studying  two 
problems  in  computer  science,  both  of  which  are  closely 
related  to  analagous  problems  in  the  behaviorcl  sciences. 

The  major  use  of  FDS  has  been  to  study  "pure  cases"  of 
problem  solving  processes*  By  observing  the  performance 
of  well  defined  algorithms  for  solving  symbolic  problems, 
we  hope  to  develop  an  empirical  typology  of  problem  typoo 
such  that,  knowing  the  type  of  a  particular  problem,  we 
will  be  able  to  predict  the  appropriate  method  for  solving 
the  problem  before  wo  do,  in  fact  solve  it.  A  soraewhat 
simpler  statement  of  this  goal  is  that  we  wsiit  to  know  whm 
certain  problem  solving  algorithms  vork  well.,  A  second 
research  goal  is  the  augmentation  of  human  problem  solving 


ability e  It  is  obvious  that  if  wa  could  produce  a  theorem 
prover  more  effective  than  man,  the  range  of  problems  which 
could  be  attacked  would  be  increased »  In  addition,  by  ex¬ 
ploring  the  potentials  of  particular  computer-executed  problem 
solving  algorithms  we  may  assist  human  problem  solvers 
in  two  other  ways c  We  might  discover  problem  solving  methods 
which,  in  addition  to  being  jtsf.  china  executable,  could  ba 
taught  to  humans  with  a  resultant  increase  in  their  intel¬ 
lectual  skills  c  By  extending  the  range  of  problems  tv'hich 
computers  can  handle,  we  can  increase  the  potential  oontri- 
but  ion,  and  altar  idle  work  distribution,  in  men-machino 
problem  solving  teams. 

The  FDS  is  an  intellectual  descendant  of  the  General 
Problem  Solver  (GPS)  of  Newell,  Shaw  and  SimciV  (1959;  Newell 
and  Simon,  1961),  from  whom  we  obtained  the  original  idea 
and  an  orientation  for  our  early  efforts t  By  now,  however, 
the  relationship  has  become  quite  distant,  an  FDS  incorporates 
several  concepts  and  programming  techniques  ^fhich  are  not 
contained  in  GPS,  and  similarly,  some  of  the  GPS  capabi¬ 
lities  have  been  dropped v  FDS  also  incorporates  many  ideas 
taken  from  recent  work  on  the  construction  of  generalized 
compilers  ("compiler-compilers”) .  As  its  namo  implies,  tho 
program  is  written  entirely  ir  FORTRAN  IV.  ALGOL  versions 
have  also  been  implemented.  We  have  not  found  any  substantial 
restrictions  due  to  limitations  of  these  languages . 

A  formal  definition  of  FDS  is  available  (Quinlan  and 
Hunt,  1967).  It,  and  program  listings  if  desired,  can  be 
obtained  from  us.  The  program  has  been  running  successfully 
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over  a  year#  To  our  knowledge,  runs  have  been  executed 
on  the  IBM  7094,  the  CDC  3600,  the  Burroughs  B  5500,  and  the 
English  Electric  KDF-9*  In  this  paper  we  shall  give  an 
informal  description  of  the  general  features  of  the  program, 
and  discuss  a  few  selected  applications  to  give  some  idea 
of  its  range  and  power* 

From  the  user's  point  of  view 

The  user  can  regard  FDS  as  an  "alterable"  black  box 
which  accepts  theorems  as  input  and  produces  solutions  as 
output*  It  is  alterable  in  the  sense  that  prior  to  the  first 
theorem  the  user  specifies  what  a  well  formed  expression 
looks  like,  what  notation  he  will  us®,  and  whst  rules  of 
inferences  are  permissible* 

Notation;  The  user  firit  indicates  the  names  of  his  operands 
and  operators.  Re  can  specify  non-numeric  constants  (TRUE, 
FALSE,  GOOD,  BAD),  variables  (X,  Y,  WHO,  WHICH),  unary 
connectives  (NEG,  NOT,  LOG,  BELOW),  and  binary  connectives 
(+,  NOT,  OR,  BELIEVES) c  In  addition,  the  program  recog¬ 
nizes  the  integers* 

Axioms;  Excepting  the  integers,  the  program  contains  no 
inherent  definition  for  the  user's  notation.  Meaning  i3 
established  when  the  user  defines  his  axioms  „  Each  axion 
is  stated  as  a  rewriting  rule,  i*e,,  structure-  A  may  be 
rewritten  as  structure  B.  We  will  use  ":=r"  to  mean  "may 
be  rewritten  as,"  although  in  fact  this  symbol  may  bo  speci¬ 
fied  by  the  user*  Axioms  are  presented  to  the  system  in 
the  conventional,  or  infix,  notation*  For  example,  the 
comoutivity  property  of  in  algebra  would  be  expressed 


as 

A  +  3  r=  3  -5-  A, 

This  rule  informs  the  Systran  that  any  two  well  formed  formulae 
separated  by  a  may  be  rewritten  with  the  .order  of  the 
formulae  reversed*  As  in  conventional  a3-gebrsu  e  well  formed 
formula  is  any  variable  or  constant  ,  any  unary  operated 
followed  by  a  well  formed  formula,  or  any  two  well  formed 
formulae  separated  by  a  binary  operator*  Parentheses  roust 
be  used  to  supply  information  concerning  the  order  of 
priority  of  operators  where  this  is  ambiguous,  in  the  same 
manner  as  they  are  used  in  school  algebra* 

System  Parameters „  As  an  option,  the  user  tray  supply  the 
program  with  sorto  "hints*1  about  the  difficulty  of  a  problem, 
ir.  terras  of  the  expected  number  of  steps  to  a  proof,  how 
large  the  expressions  should  grow  to  in  intermediate  stages 
of  proof,  and  so  forth*  These*  parameters  may  affect  the 
power  of  the  system  in  some  situations  *  in  others  its  per¬ 
formance  is  surprisingly  independent  cf  parameter  settings* 
Problems;  Problems  are  input  in  exactly  the  same  form  as 
axioms,  i*e*,  a  well  formed  formula  on  the  left  to  bo  re¬ 
written  as  on  the  right  of  a  rewriting  symbol*  Optionally, 
the  user  may  indicate  that  if  g  particular  theorem  is 
proven,  it  is  to  be  used  in  the  proof  of  subsequent  theorems* 

A  theorem  proving  problem  is  solved  by  FDS  when  the 
program  finds  an  ordered  set  of  rewritings  which  change  one 
string  of  symbols  (the  " left-hand  side"  above)  .into  another 
(the  "right-hand  side81),,  Ai*  sn  example,  suppose  \m  have 


the  axioms 
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# 

1*  A  +  B  :«  B  +  A 

2«  (A  3)  t  C  t*  A  +  (B  v  C) 
and  the  problem 

A  +  (B  f  C)  ;*  (A  +  B)  +  C. 

«/ 

This  is  a  problem  because  the  rewriting  symbol  is  not  reflexive 
•  it  acts  more  like  .the  **  implies"  of  logic  than  the 

•»*  of  algebra*  The  following  solution  would  be  obtained 
by  the  FDS: 

INITIALLY  A  (B  +  C) 

USING  AXIOM  1  (B  *  C)  +  A 

USING  AXIOM  1  (C  *  B)  +  A 

USING  AXIOM  2  C  ->  (B  +  A> 

USING  AXIOM  1  (B  +  A)  +  C 

USING  AXIOM  1  (A  *  B)  b  C  SOLUTION 

This  simple  example  illustrates  two  points.  Nowhere 
has  the  program  gore  beyond  tho  semantic  meaning  of  the 
symbols  Implied  in  the  axioms,  Tho  "cleverness"  of  the 
program  is  solely  in  deciding  which  axioms  to  apply*  and 
whether  to  apply  them  to  the  entire  string  of  symbols  which 
constitute  the  present  state-*  each  successive  If  jig  of  tho 
proof— or  to  some  cubstring  of  the  present  state. 

£E2saa-Sfess£isfi£Sea 

The  FDS  internal  operation  will  now  be  described 
briefly.  Internally,  formulae  are  expressed  in  tho  suffix., 
or  "Polish"  notation- v  A  well  formed  formula  is  defined 
as  (a)  a  constant  or  variable 4  (b)  a  unary  operator  followed 
by  a  well  formed  formula,  or  (c)  c  binary  operator  followed 
by  an  ordered  pair  of  formulae.  In  the  last  case,  we  have 


followed  the  convention  that  the  operand  which  .would  be  the 
right-hand  operand  in  the  normal  notation  is  expressed  first. 
As  an  example,  if  the  user  states  A  +  (B  +  C),  FDS  will  store 
the  string  ++CBA  •  It  is  important  to  note  that  any  operator 
symbol  will  head  a  substring  which  is  itself  a  well  formed 
formula.  Also,  the  order  of  the  symbols  in  the  FDS  string 
will  unambiguously  specify  a  tree  structure  for  en  expression, 
i.ei,  a  graph  in  which  each  node  represents  an  operator 
or  a  variable  or  constant,  and  those  nodes  which  represent 
operators  have  below  them  subgraphs  specifying  their  operands. 
The  GPS  program  achieves  an  identical  representation  using 
list  processing  techniques. 

A  problem  is  a  coosoand  to  FDS  to  rewrite  a  given 
suffix  string,  called  the  state,  and  corresponding  to  the 
left-hand  side  of  the  problem  in  the  exfcerael  representation, 
into  another  suffix  string,  called  the  goal,  which  corres¬ 
ponds  to  the  right-hand  side  of  the  external  representation. 
The  two  strings  are  compared  symbol  by  symbol.  If  they 
are  identical,  then  the  problem  is  solved.  It  is  retrans¬ 
lated  to  the  external  representation  end  printed.  If  the 
two  strings  are  not  identical,  a  difference  .ggt  is  established 
between  them.  This  is  done  by  starting  at  the  leftmost 
symbol  (the  main  operator  of  she  expression),  and  comparing 
corresponding  symbols,  When  a  difference  is  noted,  the 
pair  of  symbols  which  gave  rise  to  the  difference  are  stored 
on  the  difference  set,  and  comparison  continues,  but  does 
not  include  any  synhole  which  express  operands  of  the  symbols 
which  gave  rise  to  the  difference  pair. 
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An  ex&a pie  stay  help  clarify  this.  Returning  to  the 
sisf>X®  problem  given  above,  wa  see  that  the  internal  repre¬ 
sentation  of  axioms  I  and  2  in  FDS  would  be 
1  •  -frBA  t»  MB 
2g  4C$£A  :*  ++CBA 
^hile  the  problem  la 

4+CBA  :*  +OBA. 

The  first  difference  pair  is  the  pair  (-j^C),  whore  the 
4*  is  the  second  +  in  the  state  (left-hand)  string,  and  the 
€  1$  the  C  in  the  right-hand  3tring«  Since  the  v  begins 
®  substructure  (corresponding  to  (B  *  C)  in  the  external 
problem),  the  substring  ->CB  on  the  left  is  skipped.  Thus, 
the  second  difference  pair  is  (A,  +),  wh^ra  the  4-  is  the 
second  4*  of  the  goal  string.  This  in  turn,  begins  the 
substructure  -i-BA  (A  4 ■  B),  so  the  remainder  of  the  goal 
string  is  skipped*  Therefore,  those  two  difference  parxs 
eoapriae  the  difference  set. 

The  difference  set  states,  in  effect,  what  nasst  bo 
changed  into  what  in  order  for  the  goal  to  be  achieved. 

The  FDS  then  examines  both  the  axioms  and  the  state  and  goal 
strings  in  order  to  determine  a  set  of  pairs  (e9d>,  whore 
c  is  an  axiom  and  d  specifies  a  substrjjrjg  of  the  state 
string*  That  Is,  c  is  a  rewriting  rule  and  d  is  the  place 
in  the  state  string  where  it  will  be  applied.  Lot  us  call 
this  *r  operator «  In  the  example,  the  first:  step  applied 
axiom  1  to  the  entire  string,  while  th©  second  step  applied 
the  same  axiom  to  a  subexpression t 

In\  applying  an  oprsrstor  it.-  may  be  found  that  the  state 


8 


string  (or  substring)  Is  not  in  a  form  appropriate  for  the 
axiom’s  application.  In  this  case  a  check  is  made  to  deter¬ 
mine  how  many  changes  in  th©  state  string  will  be  required 
before  the  axiom  can  be  applied.  If  the  system  decides  to 
apply  an  axiom  which  requires  a  change  of  the  state  string, 
the  subproblem  of  making  this  change  in  the  etato  string 
will  be  attacked s  Those  familiar  with  the  GPS  literature 
will  recognise  this  as  being  very  similar  to  the  "means-end 
analysie"  and  roeur s ive  problem  solving  teehniquo  used  in 
the  Newell,  <at.  al«  pregram,,  Tho  major  difforonco  between 
FDS  and  OPS  at  this  point  is  that  in  FDS  cubprobloms  are 
not  attacked  by  recursive  application  of  the  program,  but 
instead  are  attacked  in  a  way  that  insures  that  when  there 
are  severe!  alternative  solutions  to  a  eubproblem,  an  appro¬ 
priate  solution  for  solving  both  the  subproblcm  and  advancing 
th©  solution  of  th©  main  problem  will  be  found  * 

The  performance  of  the  program  depends  heavily  upon 
its  ability  to  determine  which  operator  to  try  first. 

This  information  is  obtained  by  two  subprograms ,  both  of 
which  are  replaceable  modules  of  tho  main  program.  On© 
subprogram  "locks  .ahoac”  in  the  sense  that  it  determines 
how  many  useful  changes  will  Iks  achieved  by  o  given  operator. 
The  assumption  i.s  made  tbr.t  any  subprobloms  uhiefe  rmy  arise 
can  be  solved,  A  second  subprogram  compares  tho  operations 
rscommendod  by  the  first  subprogram  to  c.  record  of  previously 
successful  application*  of  .axioms.  Each  operator  io  classi¬ 
fied  into  ons  of  twenty  categories,  based  on  the  structure 
of  both  the  axiom  and  the  state  to  which  it  in  to  be  applied. 
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Ths  operator  is  then  assigned  n  rating  determined  by  the 
ratio  of  the  number  of  tines  operators  in  its  class  have 
lead  to  solutions  to  the  total  number  of  times  they  have 
been  tried.  These  frequencies  ere  built  up  as  the  program 
obtains  experience  in  solving  problems  ir.  a  particular  deduc¬ 
tive  system.  Simple  minded  as  the  learning  algorithm  is, 
our  experience  has  been  that  it  is  quite  affective  in  improv¬ 
ing  the  program :c  performance. 

Applications  of  the  FDS. 

W©  have  conducted  a  reasonably  large  number  of  studies 
of  the  program1' 3  performance  in  a  variety  of  areas  of  math¬ 
ematics,  To  give  some  id an  of  the  power  of  the  system,  tho 
results  of  these  studies  will  be  summarized  briefly* 

Algebra,  The  system  ic  s  quite  powerful  manipulator  of 
conventional  algebraic  formulae.  Over  e  hundred  theorems 
have  been  proven  concerning  the  manipulation  of  variables 
under  the  operations  oi  addition,  subtraction,  negation, 
end  multiplication*  The  average  time  per  theorem  is  about 
twenty  seconds. 

We  do  not  present  FDS  as  a  simulation  of  human  thought. 
In  fact,  in  algebra  it  is  a  somewhat  bettor  problem  solver 
than  most  people*  To  test  this  assertion,  we  used  FDS 
to  solve  twelve  problems  based  upon  the  following  restricted 
axioms  of  algebra: 

1 .  A  +  B  :»  B  •>  i. 

2.  (A  +  B)  -  B  ! A 

3.  A  •—  \A  ‘J*  B )  *•  li 

4.  (A  -  B)  +  C  :=-•  (A  C)  -  3 
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5.  A  +  (B  +  C)  :*=  (A  ->  B)  *  C 
6*  (A  +  B)  -  C  Je  (A  -  C)  -3-  B. 

The  same  problems  were  presented  to  thirty  und ©r  gra due te 
psychology  students  cat  the  University  of  Washington*  and 
in  thirty  minutes  they  produced  an  average  of  two  solutions 
each.  The  program  solved  all  twelve  problems  in  less  than 
five  minutes  c  One  problem,  to  prove 
(A  «  C)  »  (B  •  C)  :=  A  -  B, 

was  not  solved  by  any  of  the  students*  Informally  the 
same  problem  was  attested  by  several  professors, .including 
mathematicians  nnd  mathematical  psychologists.  .and  graduate 
students  in  mathematics,  engineering,  and  computer  science* 
Only  one  student,  a  senior  in  m thematic®,  produced  any 
solution,  and  his  was  different  from  that  produced  cy  FDS, 
By  contrast,  soma  problems  on  which  IDS  expended  a  fair 
amount  of  time  (by  its  standards)  proved  quite  easy  for 
people. 

Logic:  FDS  was  used  te  solve  problems  selected  from  the 
exercise  sets  in  an  elementary  school  tTnsw  mathematics” 
text  book,  Suppos  and  Hill's  Introductory. 

(19  ) .  Although  sore  problems  wore  found  for  which  an 

FDS  formulation  was  elraasy*  no  problems  were  found  in  this 
book  which  tha  program  could  not  solve  w 
Tr igonome try :  A  number  of  informs!  studies  i?ero  carried 
out  in  which  tho  program  wrs  used  to  solve  identity  proving 
problems  in  trigonometry,  Th^-se  studies  illxxstraited  er, 
Important  principle,  the  program  is  flexible  enough  so  that 
how  the  user  represontr  the  deductive  system  will  exert 
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a  considerable  influence  on  the  difficulty  of  obtaining 
a  solution*  Several  alternate  representations  are  possible 
for  trigonometry.  Depending  on  which  representation  was 
used,  a  problem  might  be  either  hard  or  easy.  It  has  been 
noted  that  this  occurs  in  other  problem  solving  programs 
(Ernst  and  Newell,  1967)* 

Pattern  identlf lcat Ion i  It  has  been  suggested  that  complex 
patterns,  such  as  pictures  of  houses,  can  be  recognized  as 
examples  of  a  generic  clars  by  using  a  simplification  system 
very  much  like  parsing  in  a  phrase  structure  grammar  (Minsky, 
1963;  Ledley,  1962,  1965).  For  example,  in  such  a  system, 
a  "House"  might;  be  defined  as  a  roof  on  top  of  wall,  where 
wall  could  be  defined  as  wall  or  wall  with  vTindow  or  wall 
with  door  or  wall  beside  vail,  and  roof  *  in  turn,  bo  defined 
as  roof:  or  roof  with  gable  or  roof  with  chimney ,  Even  with 
such  a  simple  "grammar,"  quite  complex  pictures  can  be  drawn* 
When  the  grammar  for  recognizing  picture  classes  is  input 
to  FBS  as  the  deductive  system,  the  program  is  capable  of 
recognizing  very  intricate  patterns  as  examples  of  a  class. 
The  patterns,  of  course,  roust  first  be  coded  into  a  symbolic 
representation  of  a  picture,  as  the  FDS  does  not  at  present 
have  any  capacity  for  graphic  input  of  data.  In  this  appli¬ 
cation  the  learning  procedures  described  proved  noticeably 
and  uniformly  effective. 

Sanderson  algebra:  Sanderson  (1967)  has  developed  an  alge¬ 
braic  representation  for  flow  diagrams  which  might  represent 
simple  computer  programs,;  The  motivation  for  his  work  was 
an  attempt  to  develop  a  deductive  system  which  could  b© 
used  to  prove  that  two  programs,  as  defined  by  their  flow 


charts,  were  in  fact  c<ju iva lent  coapitatloris*  He  proved  ten 
theorems  concerning  program  equivalence  using  this  algebra. 

The  proofs  of  all  ten  theorems  were  reproduced  by  FDS  • 
Inequalities  i  Some  attempts  have,  been  made  to  prove  tha 
theorems  of  the  calculus  of  jnsqualities,  using  previous 
results  from  logic  and  algebra.  In  these  studies  tho  FDS 
has  had  to  work  with  better  than  130  previously  proven  theorems 
While  some  elementary  theorems  have  been  proven,  the  nesc 
to  search  a  very  large  number  of  rewriting  rules  to  deter¬ 
mine  the  next  operation  has  caused  the  program  to  spend  a 
groat  deal  of  time  on  what  seam,  to  humans,  to  b©  reasonably 
easy  problems. 

Qurrept  Status  and  Fatuyo 

Thus  far,  FDS  has  been  applied  only  to  re-prove  well 
known,  and  to  a  mathematician  elementary;,  theorems.  We  hope 
to  extend  this  somewhat  in  the  next  few  years  by  applying 
FDS  to  more  advanced  areas  of  mathematics.  In  this  work 
we  do  not  expect  tho  program  to  oxcecd  the  capacity  of  the 
intelligent  human  iaa  the  vatic  is.  n*  It  may.  however,  serve 
as  a  useful  complement  to  him.  An  interactive  version  of 
FDS  has  been  lamented  on  the  Burroughs  £-5500  remote  ; 
access  system,  and  can  be  ueod  in  stud5.ee  of  man-computer 
problem  solving  teams. 

Considerably  more  work  needs  to  bo  done  on  learning 
algorithms  for  tho  theorem  prover*  Intuitively,  it  would 
seem  that  exporienco  with  theorem  proving  problems  should 
lead  to  the  knowledge  that  certain  rewriting  rules  are 
appropriate  in  given  situations. 


The  question  is,  can  wo 


writs©  ©  program  which  develops  these  rules  of  thumb  for  Itself? 
This  is  closely  related  to  our  interest  in  problem  typology— 
emn  is®  find  classes  of  problems  such  that  certain  problem 
solving  procedures  are  appropriate  for  all  members  of  the 
class? 

Ir.  stasaary,  we  have  developed  a  useful,  reasonably 
powerful  theorem  prover  using  standard  algebraic  computing 
languages •  The  system  is  capable  of  proving  theorems  which 
are  quite  difficult  for  the  average  university  undergraduate > 
but  it  lias  not  yet  produced  any  proofs  which  would  bo  consi¬ 
dered  exceptionally  good  by  a  professional  mathematician. 

This,  of  course,  ia  a  difficult  goal  to  reach,  but  wo  fool 
that  we  have  made  progrsa3  towards  it* 
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A  Fortran  Program  to  solve  generalized  algebra  type  problems  has  been 
developed.  It  is  a  heuristic  program,  designed  to  give  rapid  solutions  for 
problems  which  it  can  solve.  The  program  accepts  as  input  a  definition  of 
an  algebraic  system,  then  solves  problems  in  the  system.  The  program’s 
operation  is  described  here  in  general  terms  and  examples  of  its  operation 
are  given.  a? 
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